1 



Spectrometer Calibration by Expectation Maximization Method 
L. Yuan a 

a Department of Physics, 

Hampton University, Hampton, VA 23668 

Expectation Maximization (EM) algorithm is a parameter estimation method from in- 
complete observations. In this paper, an implementation of this method to the calibration 
of HKS spectrometer at Jefferson Lab is described. We show that the application of EM 
method is able to calibrate the spectrometer properly in the presence of high background 
noise, while the traditional nonlinear Least Square method fail. The preliminary results 
of HKS spectrometer calibration is presented. 

1. Introduction 

Expectation Maximization (EM) algorithm is a statistical method for parameter esti- 
mation from incomplete observations. It is an extension of the Maximum Likelihood (ML) 
method well known to physicists. This method was first proposed by A. Dempster etc in 
p. Since its introduction, this method has been used in a wide varieties of applications 
such as signal processing, medical image processing and genetics, to name a few ([2])- 

The EM method is well suited to handle problems with observations diluted by large 
amount of noise, since it is not known a priori a observation is signal or noise. That is why 
it is introduced into High Energy physics for track reconstruction in the ATLAS detector 
at LHC In the presence of track noise, the EM based tracking algorithm can 

obtain a track resolution more than two order of magnitude better than traditional Least 
Square tracking method . 

We have used the EM method in the spectrometer calibration for HKS experiment at 
Jefferson Lab. Jefferson Lab HKS experiment aims at obtaining high resolution hyper- 
nuclear spectroscopy by (e,e'K) reaction. To achieve this goal,it is essential to perform a 
proper spectrometer calibration to optimize the reconstruction resolution of the momen- 
tum and angles of scattering electrons and Kaons (0). The only high precision calibration 
method is to make use of the known masses of A,E° hyperons produced from hydrogen in 
CH2 target and the narrow width of \ 2 B hypernuclear ground state from 12 C target([6 ). 
These masses can be produced at the same spectrometer kinematics as the production of 
hypernuclei. 



2. Nonlinear Least Square Method 



Let {ir} denote the set of parameters which defines the reconstruction function. For 
example, the {77} can be a set of polynomial coefficients in the polynomial expansion of 
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reconstruction function. The task of calibration now is to find the best set of parameters 
{•7r} to optimize the reconstruction resolution. The missing mass E m of (e,e'K) reaction 
can be calculated from the focal plane measurement X iy % = 1, . . . N, % denotes each 
event number, and the reconstruction parameters {77} : 

£4 = fpQ,{7r}), (1) 

where f is a nonlinear function. The format of f can be derived from the kinematics 
equations. Xj represent the trajectories of the particles at spectrometer focal plane. 
Let AMj be the difference between the calculated mass and the known mass value from 
Particle Data Book M PDB , 

A Mi = E l m - M PDB . (2) 
Finally, we define a Chisquare as the weighted sum of squared AMj over all events: 

N N 

X 2 = J2 W&M* = «*(f (*« M) - M PDB ) 2 , (3) 
i=i i=i 

where Wi is the relative weights of A,S and ^ 2 B GS events. 

The set of parameters {77} which minimize the Chisquare function will define our opti- 
mized reconstruction function. This is a typical nonlinear Least Square (NLS) problem. 
The Chisquare function is still a complex nonlinear function and have to be minimized 
by numerical method. It is carried out by using CERNLIB Fortran program package 
LEAMAX (HI). 

In case we have a clean signal of hyperons and hypernuclear bound states (The sig- 
nal to noise (S/N) ratio better than 6:1), the NLS method works well, as is shown for 
simulated HKS data in our Arxiv paper ( u 5j). However, the HKS spectrometer setup 
detects very forward angle e' and Kaons in order to increase hypernuclear yield. Thus 
in real experiment, we see high accidental background between Kaon arm and electrons 
produced by Bremsstrahlung photons. For the real data, The S/N ratio in the missing 
mass spectrum is almost 1:1 (figHJ). Applying the NLS method to the data results in 
wrong calibration. As shown in figEl the accidental background under the ^ 2 B gs peak, 
which should be flat, now forms an artificial "bump" as a result of the NLS calibration. 
Clearly, the EM algorithm which is robust to noise observations should be used for the 
spectrometer calibration of HKS. 

3. Expectaton Maximization method 

We have two condiderations in order to implement the EM method: 

1. EM method is used for parameter estimation with incomplete observation. Taking 
advantage of this feature, we can define a variable Si, i = 1, . . . N, which denotes 
whether event i is a real signal (real coincidence) : Si — 1 or noise (accidental 
coincidence): Si = 0, although Si can not be observed by the spectrometer. Thus 
instead of minimize the chisquare in eqOJ we will minimize an energy function 
defined as 

E({S t }, {tt}) = J2& AM ? + - I) 2 ]. (4) 
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Figure 2. j^B missing mass spectrum used in the improper calibration of HKS spectrom- 
eter by nonlinear Least Square method. The shaded region is accidental backgroud. 
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where A is a cut off paramter. 

2. There is a large number of parameters in the parameter set {7r}, to aviod the cal- 
ibration process ending up in a local minimum, we introduce an annealing process 
([4 ). One requires each configuration of the system with energy E obey the Bolz- 
mann distribution at temperature T. One then minimizes the expectation value of 
the energy function at successively lower temperatures until final result at T — > 0. 

According to Bolzmann distribution, the probability for the system to have config- 
uration {Si, AMj, i = l,... N}, is: 

PaS t },{*}) = e- / " smit{w}) /Z, (5) 
where (3 is the inverse temperature j3 = 1/T and Z is the partition function: 

Z = I d{7r}e-" B « s <>'< w » (6) 

The EM algorithm can be divided into the expectation step and the maximization step. 
In the expectation step, the expectaion value of the energy function over the unobserved 
variable is calculated: 

Q({n} | {7T/}) =X>({5i},{7T})P({5i} I {7T/}) 

{St} 

"53 ^«*'» 

where P({5j} | {7r/}) is the probability function of assignment variables conditioned 
on the parameter set {7r/}, Pyi is the marginal probability function: 

Pm{{*>}) = £ ^W, {*'» = e-^A (8) 
Effective energy 

% = lE'^ + e-^). (9) 
Substitute equations El and |H] into equation UJ we can write up the expectation value as: 



^ (io) 



i=l 
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Pi/ can be interpreted as the probability that event i is a real signal. The minimization step 
is then to minimize the Q function with respect to the parameter set {7r}. Because the 
second term is independent of {"71"}, in the maximization step, we will minimize function: 

N N 

S ({tt} I {tt/}) = 5>,AM5v = ^(fpf,, {tt}) - M PDB f Pl l (11) 

i=l i=l 

with respect to {77}. is defined by equation [THl Again, we have added the relative 
weights Wi to adjust for the effect of A,S and ^ 2 B GS events in the calibration. The new 
values of the parameter is used to update the probabilities jV, and the g function is again 
minimized. Comparing with eqCH we can see that the EM algorithm in this case is nothing 
but an iteratively reweighted least-square procedure. The weights or probabilies are not 
constants now, but functions of AMj or {tt}. Example probability functions calculated 
for the HKS spectrometer calibration is shown in figHU 

The minimization of function q is also carried out by CERNLIB Fortran program pack- 
age LEAMAX. We have obtained preliminary reconstruction functions by the EM method 
described above. The preliminary missing mass spectra of A,E° from CH 2 target and hy- 
pernucleus \ 2 B from C 12 are shown in fig. Qand fig. EJoverlayed with background. 
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Figure 3. The dependences of calculated probability functions pi for A,E and ^ 2 B on mass 
differences AM 
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Figure 4. A and £ missing mass distribution produced by p(e,e'K) reaction from CH2 
target. The shaded region is accidental backgroud. 



